Factorio, a complex video game centered around building and resource management, has emerged as a novel tool for researchers to evaluate artificial intelligence capabilities. The game allows for testing the abilities of language models in planning and constructing complex systems while managing multiple resources and production chains. To this end, a research team developed a system called the "Factorio Learning Environment" (FLE), offering two distinct testing modes. The "Experiment Mode" contains 24 structured challenges with specific goals and limited resources, with tasks ranging from simple two-machine setups...